A New Method for Applicant of Explicit Semantic Analysis and Word Sense Disambiguation in Concept-based Information Retrieval
نویسنده
چکیده
previous Information retrieval (IR) systems based on keywords to retrieve and index documents. They may return inaccurate results when different keywords are employed to illustrate the same concept in the documents and in the queries presented by Users. In Concept-based retrieval methods have tried to tackle these troubles by using concept-based comparison between documents and queries. Therefore, accurate concept extraction of documents and queries improves performance of IR systems. In this research, we introduce a new concept-based query semantic analysis approach based on Wikipedia-based Explicit Semantic Analysis. We propose that first specify the given context of query words by using Wikipedia-based concept network named wikinet to query words sense disambiguation. Then rely on given context create related concepts of query that they will be compare to concepts of documents. Because the main aim of this paper is to provide a correct interpretation and semantic relatedness analysis of query words, we use of correlation of computed relatedness scores with human judgments. Evaluation shows that the proposed method provides improvements compared to the existing semantic analysis methods. Keywordsconcept-based information retrieval, query, query sense disambiguation,
منابع مشابه
Automatic Construction of Persian ICT WordNet using Princeton WordNet
WordNet is a large lexical database of English language, in which, nouns, verbs, adjectives, and adverbs are grouped into sets of cognitive synonyms (synsets). Each synset expresses a distinct concept. Synsets are interlinked by both semantic and lexical relations. WordNet is essentially used for word sense disambiguation, information retrieval, and text translation. In this paper, we propose s...
متن کاملOntology Based Query Expansion Using Word Sense Disambiguation
The existing information retrieval techniques do not consider the context of the keywords present in the user’s queries. Therefore, the search engines sometimes do not provide sufficient information to the users. New methods based on the semantics of user keywords must be developed to search in the vast web space without incurring loss of information. The semantic based information retrieval te...
متن کاملبررسی نقش انواع بافتار همنویسهها در تعیین شباهت بین مدارک
Aim: Automatic information retrieval is based on the assumption that texts contain content or structural elements that can be used in word sense disambiguation and thereby improving the effectiveness of the results retrieved. Homographs are among the words requiring sense disambiguation. Depending on their roles and positions in texts, homograph contexts could be divided to different types, wit...
متن کاملDistributional Semantics Approach to Thai Word Sense Disambiguation
Word sense disambiguation is one of the most important open problems in natural language processing applications such as information retrieval and machine translation. Many approach strategies can be employed to resolve word ambiguity with a reasonable degree of accuracy. These strategies are: knowledgebased, corpus-based, and hybrid-based. This paper pays attention to the corpus-based strategy...
متن کاملA method for ontology-based semantic relatedness measurement
There are many methods having different approaches for assessing similarity and relatedness and they are used in many application areas, including web service discovery, invocation and composition, word sense disambiguation, information retrieval, ontology alignment and merging, document clustering, and short answer grading. These methods can be categorized as path-based, information content-ba...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012